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FLEXIBLE EXTREME VALUE INFERENCE AND HILL 
PLOTS FOR SMALL, MID AND LARGE SAMPLES 

By Pavlina Jordanova^^ and Milan Stehlik^, 

Shumen University^ and Universidad Tecnica Federico Santa Maria^ 

Asymptotic normality of extreme value tail estimators received 
much attention in the literature, giving rise to increasingly compli¬ 
cated 2nd order regularity conditioirs. However, such conditions are 
really difficult to be checked for real data. Especially it is difficult or 
impossible to check such conditions using small samples. Beside that 
most of those conditions suffer from the drawback of a potentially sin¬ 
gular integral representations. However, we can have various orders of 
approximation by normal distributions, e.g. Berry-Esseen Types and 
Edgeworth types. In this paper we indicate that for Berry-Esseen 
Types of normal approximation and related asymptotic normality of 
generalized Hill estimators, we do not necessarily need 2nd order reg¬ 
ularity conditions and we can apply only Karamata’s representation 
for regularly varying tails. 2nd order regularity conditions however 
better relates to Edgeworth types of normal approximations, albeit 
requiring larger data samples for their proper check. Finally both ex¬ 
pansions are prone for bootstrap and other subsampling techniques. 

All existing results indicate that proper representation of tail behav¬ 
ior play a special and somewhat intriguing role in that context. We 
dispel that widespread opinion by providing a full characterization 
and representation, in a general regular variation context, of the in¬ 
tegral singularity phenomenon, highlighting its relation to an asymp¬ 
totical normality of the Generalized Hill estimator without the 2nd 
order condition. Thus application of this new methodology is simple 
and much more flexible, optimal for real data sets. Alternative and 
powerful versions of the Hill plot are also introduced and illustrated 
on ecological data of snow extremes from Slovakia. 


1. Introduction and preliminaries. Statistical Models for extreme 
value distributions have become increasingly popular in recent years, as they 
provide a much better fit for data presenting some departures from normal¬ 
ity. Statistical inference for extreme value distribution typically requires a 
2nd order regularity conditions, following works of [de Haan and Stadtmiiller 
(1996)], [de Haan and Ferreira (2006) ] and [Geluk et al. (1997)]. However, 
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to check 2nd order regularity conditions is difficult for a real data, despite 
effort of some recent papers, and this is also probably one of the main rea¬ 
sons why extreme value theory became highly complicated. As we illustrate 
in this paper, this complexity can be reduced significantly. Thus we sug¬ 
gest a more flexible approach to estimate extreme value index, which is also 
illustrated both on real and synthetic data. The newly introduced methodol¬ 
ogy is based on Karamata’s representation. Several works on representations 
have been published ([11], [12],[16], among others). We acknowledge also the 
developing of 2nd order condition methods, since we learned much from this 
methodology and, e.g. many comparisons based on such methods have ap¬ 
peared. In particular, Edgeworth expansion for the Hill estimator has been 
developed under 2nd order RV framework, see e.g. [4]. However, 2nd or¬ 
der condition does not necessary hold (see e.g. [16]) and therefore we work 
without this assumption. 

We denote by Ai, X 2 ,..., independent identically distributed (i.i.d) 
random variables (r.vs) with cumulative distribution function (c.d.f.) F such 
that F G RV-a with a > 0. The last means that there exist a positive limit 


limhH 

t^OO F{t) 


for all X > 0. It is known that it is equivalent to 


t^oo F{t) 


= X 


for X > 0 with some a > 0. The number —a is called the index of regular 
variation. 

Denote the corresponding increasing order statistics by 


A^(l,n) — A^(2,n) — — ^(n,n) 


and by 

( 1 . 1 ) 


Hx,k,n,p ■—, 


2=1 


^{n—i+l,n) ^ ^ 


A, 


(n—k,n) 


p G M, p / 0. 


The generalized Hill estimator (see [1] for original version with different 
and constrained parametrization) is dehned by 


'yx,k,n,p ■ — 


1 - 


P 


1 ^ 
rE 


2=1 


^(n—i+l,n) ^ ^ 


1 ~1N 


A, 


{n—k,n) 


P 


-1 

X,k,n,p 


p G M, P / 0. 








FLEXIBLE EXTREME VALUE INFERENCE AND HILL PLOTS 


3 


For p = 0 we consider limit, which is the well-known Hill estimator (see 
[14]), defined as 



In this paper we determine the exact distribution of Hx,k,n,p for all 
n G N, A: = 1,2, ...,n and Pareto distributed random variables. For any 
fixed k = 1 , 2 ,..., and n —)■ oo we find appropriate normalizations, with non- 
random centering, such that transformed Hx,k,n,p and jx,k,n,p are asymp¬ 
totically standard normal. More generally we show that for the case when 
the distribution of the observed random variable has regularly varying tail 
and, without using the second order regularly varying condition, the limiting 
distribution for n —)• oo and then A: —)• oo is again standard normal. 

The paper is organized as follows. First we recall important definitions 
from Extreme Value Theory and consider the relation between the classes of 
distributions that achieve asymptotic normality of the normalized Hill esti¬ 
mator and the second order regularly varying condition. We present several 
examples that show that these classes of distributions are not equivalent. 
In section 2 we consider the behavior of the Generalized Hill estimator, 
in case when the number of the order statistics is fixed and show that in 
Pareto case the distribution of Hx,k,n,p coincides with the average of specific 
powers of uniformly distributed random variables. For p = —a this distribu¬ 
tion is Irwin-Hall distribution. In comparison with the Hill estimator where 
this distribution is Gamma with parameters k and ka. In more general case, 
when the tail of the distribution of the observed random variable is regularly 
varying these distribution appear in corresponding limits, when the sample 
size increases unboundedly. In the end of the second section we show by 
simulations that also for one of the most difficult cases for estimation, when 
the distribution of the observed random variable has very slowly regularly 
varying tail, and more precisely when it is Hill horror distributed, the con¬ 
siderations for fixed sample of order statistics and their plot for increasing 
n are more informative than the Hill plots. The third section considers the 
cases when we obtain asymptotic normality of the Generalized Hill estima¬ 
tor and prove that in Pareto case we could achieve it also for the number of 
order statistics that is close to the sample size. In Theorem 3 we prove that 
the corresponding results about the Hill estimator could be considered as a 
particular case of the Generalized Hill estimator, therefore we call Jx,k,n,p 
in this way. In Section 4 we try to find ’’the most appropriate” case of p. It 
turns out that if we determine ’’the most appropriate” value in such a way 



4 


P. JORDANOVA AND M. STEHLIK 


that to achieve the smallest variance, this value is p = 0 and the correspond¬ 
ing ’’best” Generalized Hill estimator is just the Hill estimator. However if 
we consider the best value of p as the one that leads us to the fastest rate 
of convergence between the distribution functions and in the sense of the 
Berry-Esseen theorem, the best value of p is — I. 22 I /7 and the estimator 
does not fluctuates too much if we replace this value with some value close 
to it. In Section 5 we apply the last results on the real data example and 
show that in practice the difference between the last estimators is not so 
much important in case when the variance of the observed variable exists 
and it has Pareto tail with a > 2. Therefore the extreme value theory could 
be applied widely in practice. 

Through the paper we use the following notations: = is for the equality 
in distribution and -A for convergence in distribution. 

1.1. Karamata’s representation and singular integrals. In 1930 Kara- 
mata (see [15]) introduced the notion of regular variation and proved some 
fundamental theorems for regularly varying (RV) functions. Here we recall 
his representation theorem. 

Theorem 1.1. Karamata’s representation theorem A function U : 
(0, 00 ) —)• (0, 00 ) is regularly varying with index p iff U has the representation 

U{x) = c(x)exp( J t~^p{t)dt) 

where lima;^oo c{x) = c G (0, 00 ) and lim^^oo p{t) = P- 

Remark 1. One important issue which can clarify complexity of com¬ 
putations with concrete regularly varying tail distribution functions is the 
fact that albeit we have a Karamata’s representation theorem (which is only 
an existence theorem), not always the involved integrals are real and we can 
meet both complex valued functions and undefined integrals, so that not all 
forms of representation can be always applied. As an example may serve 
well Karamata’s representation for ln(x) (see e.g. [24]) whieh may need to 
eompute integral 

\n]c-i ~ ^ ^ _ / undefined, for e < t, 

X \ —vrl-|-ln(ln(t) — 1), otherwise, 

where e and I are Euler’s number and eomplex unit, respeetively. Karamata 
himself (see [17]) suggested extension of slow variation to analytic functions 
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defined on complex plane and this was done in [25]. This is important to 
note, since in this section we compute several limits using inverse function 
b[n/k) for n/k k, oo and for the sake of simplicity some complex represen¬ 
tations of real funetions will be used. In general the existence of sueh real 
inverses is related to the so called asymptotic inverse functions (see [3]). 

1.2. The second order regularly varying condition. In this section we clar¬ 
ify several issues about equivalence of asymptotic normality and 2 nd order 
condition formulated in Theorem 4.3, [10]. The following definition of the 
second order regular variation comes from [7], [ 8 ] and [10]. 

Definition 1. If the tail function of a non-negative random variable X 
is F := 1 — F and F : M —)• [0; 1] satisfies that F G RV-a with a > 0. 
Then F is said to he of seeond-order regular variation with parameter p < 
0, if there exists a function Aft) that ultimately has a eonstant sign with 
limt^oo A{t) = 0 and a constant c / 0 such that 

(1.3) lim -= Hap{x) = / u^~^du, x > 0 

t^QO Aft) 

Then it is written as F € 2RV-oi,p and A{t) is referred to as the auxiliary 
function of F. 


It is known from [7] or a more relevant form in Geluk et al. (1997) [14] 
that if Ha^p{x) is not a multiple of x~°‘ then p < Q implies that there 
exists a c 7 ^ 0 such that Ha,p{x) = and G RVp and no other 

choices of p are consistent with A{t) —?■ 0. There are many distributions 
which satisfy the second order RV condition. These are (see e.g. [ 6 ]): Cauchy 
7 = l ,/3 = — 2 , Frechet(l) j = I, p = —1, Student t(4) 7 = 1/4, p = — 1 / 2 , 
t(10) 7 = 1/10, p = —1/5 or loggamma 7 = 1/3, p = 0. 

The following theorem 4.3 of [10] claims, that suppose F G RV-a and 
Von Misses condition (1.4) holds then the asymptotic normality N{c,a'^), 
c 7 ^ 0 of Vk{Hx,k,nfi ~ 1 /a) is equivalent to the second order regularly 
varying condition (1.3). However, it turns out that under these conditions 
the asymptotic normality with c 7 ^ 0 is not equivalent to the 2 nd order 
regularly varying condition. 


Theorem 1.2. (Theorem 1.2 of [10]) Suppose 1 — F £ RV-a and 
that the Von Misses condition holds: F has density F' satisfying 


(1.4) 


lim 

x—¥oo 


xF'{x) 

n^) 


= a. 
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Then 1 — F is second-order regularly varying iff for some 9 G [0,1] there 
exists a function U G RVg such that C/(i) —^ oo as t ^ oo and there exist 
non-zero constants c and a > 0 such that with k = \U (n)] we have 

'/k{lx,k,n,o - ^ N{c,a^). 


From the proof of this theorem it is clear that they determine c by the 
limit relation 


(1.5) 


c = 


limVkiy r (l-F(s))— 

^ Jb{n/k) ^ ® 


In the following examples we show that it is possible to exist a distribution 
that satisfy the conditions of this theorem and the second order regularly 
varying condition but not to exist subsequence of such that to obtain 

0 < c < oo. We conjecture that this discrepancy could be caused by non 
conformal integral representation for jn in [7]) page 384. Namely, between 
(1.8) and (1.9) in [7], page 384 we can see 


( 1 . 6 ) 


n 


7n = 


lU{n/k) 


In s dF{s) 


In = 


n r 

kJx, 


In s dFn{s) 


n r 
kJx, 


(n — k,n) 

In {s/dFfi{s) 


f -y 

-^{n — k,n) 

Let us have Paretofi, a) case, then U(x) = and 1 — F{x) = x~°‘, x > 
1,0; > 0. Now let us denote t := n/k > 1 Then, according to (1.6) we have 


(1.7) 


ln = t 


poo 

/ log(s)as“““^ 


ds = at 


1 + log(t) 1 + log(t) 


a^t 


a 


Bellow (1.9) they write 


n ON ^ d\ogU{s) 1 

(1-8) = w / -= - 

^ J n/k ^ ^ 

But, (1.8) does not equal to (1.7), and difference between both 

integral representations of 7 „ converge to oo for t = n/k ^ oo. 

We guess, that non-conformal integral representation could be caused 
by mismatch of Theorem 4.1 from [5], which is cited just before introducing 
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representation (1.6) in [7], since if we calculate the expression in [5] in Pareto 
case it gives a correct value, namely 


a*{t)=t / F{e^)ds = t 

J lnU{t) 



a a 


Thus in the following Examples 1-3 we will also show that c = oo is 
accompanied by infinite difference between above representations (1.6) and 
(1.7) for 7 „ when n/A: —)■ oo. We will denote this difference A„ := 7„(1.6) — 
7„(1.7), where 7n(1.6), 7„(1.7) denote the 7 „ from above representations 
(1.6) and (1.7), respectively. 


Example 1. (Hall/Weiss class) The most common example of F € 
RV-a, a > 0, p < 0, is the so called Hall/Weiss class of distributions 


(1.9) 


F{x) = 1 — 


1 + xP 
2x“ 


X > 1. 


Briefly we will denote this by F £ HW{a,p). 
F G 2RV-a,p because 


F{tx) 

~W) 


- 


tp 


l + tp 


{xP - 1 ), 


we can chose A{t) = and this function is always negative and regularly 
varying with parameter p. The distributions from this type also satisfy the 
Von Misses condition (l-f)- 


lim 

X^OO 


xF'{x) 

F{x) 


lim 

X^OO 


a + {a — p)xP 
I + xP 


= a. 


Now let us try to find subsequence o/ n G N and constant c 0 and 
c < oo, described in (1.5) such that to apply Theorem 1.2. 

Case 1: Hall/Weiss distribution F[W{1, —1) 

Let us consider F £ HW(1,-1), i.e. 


1 7 . I 

F(x) = l- ^ , x>l. 

We already mentioned that the tail of this distribution function belongs to 
RV-i U 2RVi^-i and it satisfies the Von Misses condition (I. 4 ). R is not 
difficult to calculate that 



x + 1 


x > 1 
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Therefore 


= p>i. 


Let us now find c in (1.5). 


c = 


Im / (l-i^(s))-) = Im ^ / 

n^oo o(n) Jb{n/o(n)) S a n^oo \0[n) Jb(n/o{n)) 


+ S 


-1 


2s2 


-ds — 1 


= lim \/o{n) 


n 


+ 


2 o(n) \b{n/o{n)) 2lfi{n/o{n)) 


- 1 


lim 

n^oo 


\fo{ 


n] 


1 + ^1 + n/o(n) [1 + 


- 1 


= lim -s/o(n) = oo. 


Therefore such a subsequence, k{n) mentioned in Theorem 1.2 does not exist. 

Now, let us compute the difference An := 7 n(l- 6 ) — 7 n( 1 . 7 ),i := n/k > 0 
We have 

2 In b{t) + 1 + 2b(t) + 2b{t) In bit) 

- 46(7 - 

^,(L7) = VmT) + 4-t 

o 

and finally lim„/fc^oo (7t(l-6) - 7t(l-7)) = oo. 

Case 2: Hall/Weiss distribution HW[2,-1) 

For F G HW[2,-1) we have 


F{x) = 1 - 


1 + a; ^ 
2x2 ’ 


X > 1. 


The tail of this c.d.f. belongs to RV-i U 2RV2,-i and it satisfies the Von 
Misses condition (1.4)- 



2x3 
X + 1 ’ 


X > 1 


Therefore for p > 0 

( 1 . 10 ) 



^54p + 6v^-6p3 + 81p2 


P 


^ 54p + 6-\/—6p3 + 81p2 


6 
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c = 


We have to compute 

lim \/(^(-^ [ (l-F(s)) —--) = Im f 

1^00 o{n) Jb(n/o(n)) S a n^oo \ o(n) 


b{n/o(n)) 


l/s+l 1 
2 


For further purpose, let us denote njoin) ;= t, we know njo{n) —oo for 
all o(n), thus we study t ^ oo. 

We have for all 


poo 

b{n/o{n)) > 0 : t 

Jb{t) 


1/s + 1 I ^ “1“ 36(t) 
-as = t- 


2s3 


I2h{tf 


( 1 . 11 ) 


^^2 _j_ V 54f-|-6v^6t^+81f^ 


+ 








+ 


^54t-|-6\/-6f^-l-8TF 
By using of the last expression (1.11) we have 


r * 

lim t / ———ds = 6 

n/o(n)^oo Jb(t) 2s 


1/s + l 


Thus, for any choice of k{n) = o(n) we obtain that 

_ 77 ^C)0 J „ 1 

lim (l-F(s)) -) =+oo 

o{n),k^oo,n^oo 0[n) Jb(n/o(n)) “ 


C = 


Now, let us compute the difference A„ := 7„(1.6) — 7n(1.7),t := n/k > 0 
We have 

^6 In b{t) + 2 + 3b{t) + 6b{t) In b{t) 

=‘- m? - 

For 7t(1.7), the integral was not able to write in a form of elementary func¬ 
tion, so we shall compute only the limit, necessary for limiting difference, 

T /I rrN 1- dlllbit) 1 

hm 7t(1.7) = hm t — - — = 

£—)-oo t^oo at 2 

and finally lim„/fc^oo An = limt^oo (7t(l-6) - 7t(l-7)) = oo. 

Case 3: Hall/Weiss distribution FIW{1,-2) 

For F £ FIW (1, —2) we have 


1 + X 


-2 


F{x) = 1 


2x 


X > 1. 
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As a particular case of Hall/Weiss distribution the tail of this c.d.f. belongs 
to RV-i U 2 RVi- 2 and it satisfies the Von Misses condition (1.4)- Let us 
now try to calculate c. 



2x^ 

+ 1 ’ 


X > 1 


Therefore for p > 1 



^ 54p + p^ + 6 


6 ^ 54p + + 6 81p^ + ^ 


We have to compute (1.5) 


c = 


+ ] 


lim / (l-F(s)) —--) = lim f 

1^00 o[n} Jb{n/o(n)} S a n^oo \o[n} Jb{n/o{n)) 25 ^ 

We have for all 

b(n/o(n)) >0: t 


S^ + I, l + 3b{ty 
ds = t- 


6b{ty 


( 1 . 12 ) 


t{l + 3( 


^ 54t+f3+6V'81t^+3t-^ 


+ 


6 Viii+tHWiTF+lF ® 


+ 1)^) 


^54t+t3+6V81t^+3f^ 


+ 


t2 


+ 1 ) 


6 ^54t+t3+6v/-8U2+jF ® 

57/ of the last expression (1.12) we have 


113 


lim t 


I 


°° s2 + l 


ds = 6 


n/o{n)^ao Jb(t) 

Thus, for any choice of k{n) = o(n) we obtain that 

, _ „ /-oo 

c= lim Y^o(n)(-^ / (1 - F(s)) 

k=o(n),k^oo,n^oo 0[n) Jb(n/o(n)) 


ds 1 , 

-= +00 

s a 


Now, let us compute the difference := 7„(1.6) — 7n(1.7),t := n/k > 0 
We have 

, 1 + 3b{ty In b{t) + 3 In b{t) + 36(i)^ 


7t(1.6) = t- 


6b{ty 


ds — 1 
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For 7t(1.7), the integral was not able to be written in a form of elementary 
funetion, so we shall eompute only limit, necessary for limiting difference, 

i.e. 


lim 7t(1.7) 

t—^OO 


lim t 
t —>-00 


d\nb{t) 

dt 


= 1 , 


and finally A„ = limt^oo (7t(l-6) - 7t(l-7)) = oo. 


Example 2. Log Erlang(2,1) 

We have p = Q, a = 1 (see [10]). 


F{x) = 1 - 


1 + In X 

X 


X > 1 


F^^^ l+ln(x) 

b(p)=f^] (p) = exp(-LW(-—) - 1), 

\F J ep 

where LW is the principal real valued branch of Lambert W function, see 
[21]. The slowly varying function in this case is L{x) = 1 + In x. We have 


r°° ds 

limt (l-F(s)) — 
Jbit) s 


lim t 
t—>-oo 



1 + ln(s) 


ds = 


= e lim 

)-oo 


-l + LW{-i) 


- 1 ' 


in'!., 


= OO 


Let us choose k = o(n), then we have 

lim j (1 - F{s))— - -) 

n^oo o[n) Jb(-P^) s a 

o{nj '' 


c = 


-= oo. 


Now, let us compute the difference := 7„(1.6) — 7n(1.7),t := n/k > 0 
We have 


7t(l-7) = - 




and finally lim^/^^^ A„ = limt^oo (7t(l-6) - 7t(l-7)) = oo. 
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Example 3. Slowly varying function satisfying 2nd order RV 
condition with p = 0 

F{x) = 1 — ex~^ ln(3:), x > e 


It is easy to show that this function satisfies 2nd order RV condition with 
p = 0 with A{t) = (ln(t))“^. We have 

1 ^ ^ X 
= {x) = , , ' 

eln(x) 


1 


b{p) = = ip) = -epLW{ -), 

\FJ ep 

here LW is the principal real valued branch of Lambert W function, see [21 ]. 
We have 

ds . /■°° eln(s) 


lim t 

£—)-oo 


r{1 - F{s))-= lim t [ 2 

Jb{t) 'S Jh{t) S 


-ds = 


= e lim t 


l + ln{-etLW{-jr)) 


-etLW{-^) 
Let us choose k = o{n), then we have 


= oo 


c = 


lim / (1 - F{s))— - -) 

1^00 o(n) " ) s a 

^ o{n)' 


-= OO. 


Now, let us compute the difference := 7„(1.6) — 7n(1.7),t := n/k > 0 
We have 

7^(16) = + i + 


7t(l-7) = - 


l-LlE(-i) + etLlE(-i) 


and finally lim^/^^^ A„ = limt^oo (7t(l-6) - 7t(l-7)) = oo. 


Moreover, F{x) = 1 — (x/(5)““, x > 6 that is Pareto{a,6) distribution, 
belongs to RV-a- It does not satisfy the second order regularly varying 
condition and we have c = 0. 

Further on we clarify the conditions that we need to impose in order to 
obtain asymptotic normality of the Generalized Hill and in particular of the 
Hill estimator. Some alternative approaches, albeit preliminary, could be find 
in [13] and [18]. Therefore we show that asymptotic normality is possible to 
be achieved without the second order regularly varying condition. 
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2. The limiting distribution of the Generalized Hill estimator 
for fixed number of order statistics. 


2.1. Pareto case. We start our investigations with the case when the 
observed random variable is Pareto distributed and find the exact distribu¬ 
tion of the Generalized Hill estimators for fixed number of order statistics k 
which is less than the sample size n. 


Proposition 1. If the c.d.f. F is Pareto(a,6), i.e. if 

then 

1 ^ 

( 2 . 2 ) 


1X,k,n,p 


A ^ 
P 



1 

k 



p GR, p 0. 


lx,k,n,o = ^ ^ In > 
i=l 

where Ui,U 2 , ■■■,Un are i.i.d. uniformly distributed r.v’s on (0,1). 

Proof: Let < U(2,n) ^ ••• ^ ^{n,n) be the upper order statistics of 

UuU2,...,Un. 

It is not difficult to check that 

(2.3) {1 - = l,2,...,n} = = l,2,...,n} 

and 

(2.4) / = 1,2,...,A:| = 1,2,...,A:} . 

L ^{k+l,n) J 

Recall, the probability quantile transform states that 

(2.5) = l,2,...,n} = {F^([/(i^„)),i = l,2,...,n} ,i = 1,2, 
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Therefore for F - Pareto 

(2.6) = l,2,...,n} = |5(1 - = 1,2, . 

For p ^ 0, the dehnition (1.1) and equalities (2.3) and (2.4) give 


Hx,k,n,p - 

2 = 1 




\ P 1 
(n— 2 +l,n) \ d ^ 


X, 


(n—k,n) 


-Y 

h 


2=1 


l-f/, 


(n— 2 +l,n) 


k 

k , k 


1-U> 


{n—k,n) 


k 

A ^ 

~k 2 -^ 


i=l 


u 


{i,n) 


u 


(fe+l,n) 




2 = 1 


2=1 


Analogously 


lX,k,nfi — 


E>» 


2=1 




X, 


(n —2 + 1,72) \ 1 

~k 


{n—k,n) 


El" T 


2=1 


1 — U(„_. 


(n— 2 +l,n) 


-u> 


{n—k,n) 


d 1 


^In 


2 = 1 


U 


(i,n) 


u 


(fc+l,n) 


A ^ 
~k 


Ei"U 


2=1 


{i,k) 


K 

^iyin ur 

h * 


i=l 


□ 


Note: In this case: 

1. the distribution of ^x,k,n,o is Gamma{k,ka). It is well known that it 
has mean l/a and variance l/{ka‘^). Therefore the larger the number of 
order statistics, the smaller the variance of the limiting distribution. 

2- Hx^k n-a is Irwin - Hall distributed. 


2.2. Regularly varying case. In the next statement we suppose that the 
sample size increases and do not suppose the exact Pareto distribution of 
the observed random variable, but only regularly varying tail of its distribu¬ 
tion function and obtain the same limit distribution of the Generalized Hill 
estimator for fixed number of order statistics, k. 


Proposition 2. Let Xi,X 2 , ..■,X„ he independent copies ofX. with d.f. 
F, 

(2.7) F e W_„. 

For fixed k, a and p < 0 and n —)• oo 

I ^ p 

Hx,k,n,p —> Yl ’ 


( 2 . 8 ) 
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'yX,k,n,p ^ 


d 1 


P 



p p / 0. 


(2.9) 


lX,k,nfl —^ ^ 



where Ui,U2,...,Ufc are i.i.d. uniformly distributed r.v’s on (0,1). 

Proof: The prove of (2.9) follow immediately from Theorem 1 in [13], 
the quantile transformation and the presentation of the Erlang distributed 
random variable as sum of i.i.d. exponentially distributed random variables. 
Let us now prove (2.8). 

Denote the distribution of X^,X 2 , ...,X^ by Fp, then 


Fp{x) = 1 - Fp{x) = P(X? >x) = P(Xi > xp) = F(xp). 


(2.7) imply that Fp{x) G 

By the probability quantile transformation, (2.3) and (2.7) we have that 
there exists a slowly varying function L such that 



= {Fp^(l - \J^n-^+l,n)),^ = l,2,...,n} 




The reciprocal of a uniformly distributed r.v. is a.s. greater than one. 
The Karamata-representation theorem for regularly varying functions entails 
that there exist measurable and bounded functions c{x), converging to a 
constant and e(x), converging to 0, when the argument is close to infinity, 
and B > 0, such that 



exp 


(/. 



dx \ ,i = 1,2,..., n 
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Consider Hx,k,n,p defined in (1.1). 


H 


X^k,n,p — 


E 




2=1 


U 


(i,n) 


' {k+l,n) 


( ) 


1 


U 


(fe + l,n) 


exp 


e{x) 


dx , 


where c{x) —>■ cq S (0, oo) as x —>• oo, e : R'^ —)• R'^ and e(t) —0 as t —>• oo. 
By (2.4), for n G N and A: = 1, 2,..., n, 


h 


U, 


(*,n) \ “ d 1 


If 

( 2 . 10 ) 


k^W 

1=1 


A„, := 


’ (k+l,n) 


2=1 2=1 


Hx,k,n,p , 


fc ^ vu 

2=1 


U 


(i,n) 


(fc+l,n) 


0 , 


( XJ" \ 

“ (cf. 

Theorem 4.1. of [2]) and the proof would be completed. 

We check (2.10). By the triangle inequality 


1 




k^ 

2=1 


U 


(i,n) 


u 


(fe+l,n) 


Ur,- 


(i,n) 


1 


U, 


(fc + l,n) 


exp 



- 1 


p < 0 imply that for n G N, /c = l,2,...,n and i = 1,2, 0 < 


u 


(i>n) 


u 


(fe + l,n) 


< 1. Thus 


0 < A„ < 


E 

i=l 


U 


(i,n) 


1 


u 


(k + l,n) 


exp 



(i,n.) e{x) 


dx — 1 


/U“^ X 

(fc+l,n) 


Now we have to show that the summands in the above expression converge 
in probability to zero for n ^ oo. The function T{x,y) = x.y is continuous 
in the point (x, y) = (1,1). In order to use the continuity of composition we 
have to check the following two convergences 
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and 

( 2 . 12 ) 


exp 



(i,n) e{x) 


dx 


(k + l,n) 


n —)• oo. 


The function c : R~^ —)• R~^ is such that c{x) cq £ (0, oo) as x —)• oo. 
Recall X go. Then and 0 as n —)• 00 . 

n 

Thus, (2.11) follows by continuity of g{x,y) = | in {x,y) = (co,co) and the 
Slutsky theorem (about the continuity in probability of the composition). 
Consider (2.12). It is enough to prove that 


rU 


(i.o e{x) 


dx —>■ 0. 


iv 


(fc + l,Tl) 


By 00 we have and 0 as n —)• 00 . 

n 

In view of Karamata-representation for regularly varying functions £{t) —)• 0 
as t —)• 00 . Consequently a.s. for eo > 0 there exists E N, such that for 

n > Heo, 


U 


£{x) , I . I 1 I I 

/ - dx\<£o\ -da:|=eo|ln 

'Uhi, 1 ® ^ ^ ^ 

(A: + l,n) (fc + l,n) 


(fc+l,n) 

{i,n) 


Let e > 0. By (2.4) 


0<P(| >e) 


(fe + l,n) 


< P(eo I In I > e) < P(£o I In U(,,fc)| > e). 

^ (2,n) 

The random variable In U(j does not depend on n and it is a.s. finite, 
hence for eq 0 we obtain that (2.12) is satisfied and we complete the 
proof. 

□ 


2.3. Diagnostic plots for fixed k. For applications it is very important to 
consider the behavior of these estimators for fixed sample size n and fixed 
number of order statistics k. A very frequently used example, that shows the 
disadvantages and mainly the slow rate of convergence of the Hill estimator 
is the distribution with the following quantile function 

F^ip) =-(1-P)"^^"ln(l-p), p£ (0,1). 


(2.13) 
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See [9]. We call this distribution Hill-horror distribution with parameter 
a > 0. Briefly F S We can determine easily U{n) = — 

1 /n) = n^/"lnn and this distribution satisfies the 2nd order regular varia¬ 
tion condition with A{t) = 1/ In t and p = 0. 

[9] show that in that case it is almost impossible to determine a using 
Hill plot. Therefore they call this plot ’’Hill horror plot”. 

In the next figures we show that when fix k ’’sufficiently big” and let 
n > k to infinity, using Proposition 2 we could determine a with reasonably 
small error also for this Hill horror distribution. We suppose that we do not 
know the exact distribution therefore we do not use the whole sample, but 
only follow the algorithm that allows us to use the above theorems. 

Example 4. a = 0.5. Here we simulate 1500 independent observations 
of a random variable with Hill horror distribution with parameter a = 0.5. 
It is well known that the values of this distribution fluctuate too much and 
their expectation does not exist. Then we chose the threshold in such a way 
in order to have enough observations for CLT ”to work”. In this case we 
chose the threshold 50 and obtain n = 189 observations above it. Having 
these observations we would like to estimate the tail of the distribution. The 
plot of the mean excess function built on these n = 189 exceedances is given 
on Figure 1 a). Further on we plotted the Hill estimator (lines and dots) and 
Generalized Hill estimator for p = —2 (lines) for fixed k = 30, fc = 60 and 
A; = 80 and different n = k + 1,..., 189. The straight line presents the true 
value 0/7 = 2 . The corresponding plots are given on Figures 1, b), c) and 
d). 

Example 5. a = 1. In this example we simulate 1500 independent ob¬ 
servations of HillH(l) random variable. The values of this distribution fluc¬ 
tuate less than in the previous example, but the mean still does not exist. We 
chose the threshold 10 in such a way in order to have enough observations 
for CLT ”to work” and obtain n = 181 observations above this threshold. 
Having these observations we would like to estimate the tail of the distribu¬ 
tion. The plot of the mean excess function built on these n = 181 exceedances 
is given on Figure 2 a). The plots of the Hill estimator (lines and dots) and 
Generalized Hill estimator for p = —1 (lines) for fixed k = 30, A; = 60 and 
A: = 80 and different n = k-\-l, ..., 181 are given on Figures 2, b), c) and d). 
The straight line presents the true value 0/7 = 1 . 

Example 6 . a = 2. The variance of the HillH(2) random variable pre¬ 
sented in this example does not exist, but the expectation exists. We simulate 
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b) alpha = 1/2, k = 30 




c) alpha = 1/2, k = 60 



d) alpha = 1/2, k = 80 



Fig 1. Figures to Example J^., cl = 0.5 
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b) alpha = 1, k = 30 



Fig 2. Figures to Example 5, a = 1. 
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b) alpha = 2, k = 30 



Fig 3. Figures to Example 6, a = 2. 


1500 independent observations of HillH(2) random variable and again we 
chose appropriate threshold in order to have enough observations for CLT 
to give relatively good approximation. Here the threshold is 3 and the number 
of the observations above it is n = 170. Having these observations we would 
like to estimate 7 = 1/a. The plot of the mean excess function built on these 
n = 170 exceedances is given on Figure 3, a). The Hill estimator (lines and 
dots) and Generalized Hill estimator for p = —0.5(lines) for fixed k = 30, 
/c = 60 and A: = 80 and different n = k + 1 ,..., 170 are given on Figure 3, b), 
c) and d). The straight line again presents the true value 0/7 = 0.5. 

Note: In these examples if we use the known form of the distribution, we 
do not need to chose any high threshold, in order to estimate 7 because 
the Hill horror distribution has Pareto tail. In that case we can also use the 
moment or other good estimators, however the situation is usually not such 
in practices. Therefore here we chose the ’’appropriate” threshold in order 
to follow the algorithm that is possible to apply to real data with unknown 
c.d.f. 
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d) alpha = 1/2, k = 80 


d) alpha = 1/2, k = 80 



Fig 4. a = 0.5. The Hill estimator (left) and Generalized Hill estimator, forp = —2(right), 
fixed k = 80 and different n. The straight line presents the true value 0 / 7 . 


d) alpha = 1/2, k = 80 


d) alpha = 1/2, k = 81 


H-1-:-1-:-1-H 

so 100 120 140 160 180 200 



Fig 5. a = 1. The Hill estimator (left) and Generalised Hill estimator, for p = —l(right), 
fixed A: = 80 and different n. The straight line presents the true value of y. 


Bootstrap techniques. Having in mind the above considerations, in this 
subsection we make 1500 simulations of independent observations of the 
random variable with the Hill horror distribution with the corresponding 
parameter a. Then we take 1000 different subsamples, without replacements 
and of sample size 1350 (90% of all observations), determine the threshold 
in such a way that to have 200 its exceedances and for k = 80 and for 
n = 81,82,..., 200 we calculate Hill and the Generalized Hill estimators 
for these samples, k and n. We repeat this procedure 1000 times. Having 
these estimators for fixed k and n we calculate the averages of the Hill and 
the corresponding Generalized Hill estimators and plot them by - 0 - line, 
take their minima (- -) and maxima (dotted line) and again plot them. The 
resulting plots are given on Figures 4, 5 and 8 . The real estimated value 7 
is given by straight line. 
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d) alpha = 1/2, k = 80 



d) alpha = 1/2, k = 80 



Eig 8 . a = 2, The Hill estimator (left) and Generalised Hill estimator, for p = 
—0.5(right), fixed fc = 80 and different n. The straight line presents the true value of 

7- 


Having this technique, using relatively small one initial sample of 1500 
observations we obtain relatively good estimators of 7 and a. 

3. Asymptotic normality of the Generalized Hill estimator. The 

asymptotic normality of the Hill estimator is investigated by many authors. 
See e.g. [13]. They prove asymptotic normality of the Hill estimator with¬ 
out the second order regularly varying condition but the number of order 
statistics that participate in the estimator goes to 00 in a very specific way. 
Most of the authors use the second order regularly varying condition in or¬ 
der to improve the rate of the convergence. Here we consider the asymptotic 
normality of the generalized Hill estimator (the Hill estimator could be 
considered as a particular case for p = 0) without the second order regularly 
varying condition. In order to apply the Central Limit Theorem(CLT) let 
us remind the numerical characteristics of the limiting distributions in the 
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previous theorem. Let U ~ C/(0,1), a > 0, 7 = ^ and p & R, then 
a) For s = 1, 2, sp < a, i.e. spj < 1, 


(3.1) EU-^P/^= [\-^P/'^dx = 

Jo 


^—sp/a+l 


a 


1 


—sp/a + V^ a — sp 1 — sp 7 


b) For a G R and s = 1, 2,..., 


(3.2) 

c) For 2p < a, i.e. P 7 < ^, 

(3.3) 


EhiU a = — = sj. 
a 


p2^2 


VarU-^'^ = EU-^'‘-(EUP'y = 2p,)(l - ' 


d) For a G R 
(3.4) 


—— 1 2 

Var mU “ = —= 7 


It is well known that the Pareto distribution does not satisfy the second 
order regularly varying condition. In the section 2 we proved that the distri¬ 
bution of the Generalized Hill estimator in this case coincides with the one of 
the transformed average of k i.i.d. Uniformly distributed random variables 
over the interval (0,1). Having in mind the CLT it is very natural to ob¬ 
tain that the distribution of the Generalized Hill estimator is asymptotically 
normal. In the following theorem we prove this result. 

Proposition 3. Let Xi,X 2 , ...jX^ be independent copies ofX. with d.f. 
F[x) = 1 — X > 1 and let $ he the standard normal d.f. Then 

1 . 

(3.5) lim lim —2)1 ^ x G M. 

k—>-oo n^oo 7 


2. for 2p < a, he. p 7 < i, 


'\/k{Hx,k,n,p 1 —p.y) 


(3.6) lim lim P{ -^-EJ12_ < x) = <l>(x), x G M. 


fc^cxD n—>-oo 


Vl-2p7(l-p7) 


3. for 2p < a, i.e. p'y < h, 


1- V ^3X,k,n,p 3 ) . \ \ _ 

lim hm Pi. - , ’ , - < x) = 4'(x), x G 

fc^oon^-oo 7(1-P7) 

Vl-2p7 


(3.7) 
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4-- for 2p < a, i.e. p'j < h, 


- - -Hx,n-l,n,p d 

Vn - 1-—- 


-P7 


>7V(0,1) 


(—P7+1)U—27P+1 


(3.8) 


\/l-2p7 


(3.9) lim i,n,o — t) ^ ^ ^ ^ 


7 


Proof: For the anyone of the following proofs we use Proposition 1 and 
the CLT. 

1. By (3.2) for s = 1 and (3.4) we have 

,. „,Vfe(7A,M,o-7) ^ ^ “ -7) ^ ^ ^ 

iim lim F[ - < x) = iim F[ - - - < x) = ^(x), x G 

k^oofi^Qo ‘y k^oo y 

2. By (3.1) for s = 1 and (3.3) we have 


VkiHx knv- T^) Vkii “ - j^) 

lim lim P{ -^ < x) = ^lim P{ -^^ < x) = <h(x), x G 


fc->cxD n—>-oo 


Vl-2p7(l-p7) 


Vl-2p7(l-p7) 


3. Here we consider the function h{y) = p “ p) with derivative h'{y) = 


py 


■, apply the delta method and obtain 


/ 


lim P{ 

k^oo 


y/k {^x,k,n,p - 7 ) 




< X 




= lim P{ 

k^oo 




1 - 




7 


< X 




= .hL ^ H = i-w. 

\ Vl-2p7 

4. According to Proposition 1 


^ n—1 

Hx,n-l,n,p = ^ ^ ^ 


-P7 


i=l 
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Let us now apply the CLTh 



(1-P7)V1-^P7 


We apply the delta method and obtain the desired result. 

In analogous way we prove the corresponding statement for the Hill esti¬ 
mator. 


□ 


Note: 1. Now we obtain the following confidence intervals for 7 having k 
and n large enough such that k « n, 



(3.10) 


2. The statement 4. shows that it is not obligatory k to be infinitely 
small function of n in order to obtain asymptotic normality without random 
centering and without second order regularly varying condition. 

3. Note that (3.5) is just a particular case of (3.7) and (3.9) is a particular 
case of (3.8) for p = 0. 

Further on we generalize these results for any distribution with regularly 
varying tail. 

Theorem 3.1. Let Xi,X2,...,X„ be independent copies of X. with d.f. 
F £ RV-a, o-nd let $ be the standard normal d.f. Then 

1 . 


(3.11) lim lim P{ 


k—>-oon^oo 


V^(7X,fc,n,0 - 7 ) 

7 


< x) = ^{x), X e M. 


2. 2p < a, i.e. p 7 < 


(3.12) lim lim P{ 


fc->oo n->oo 



\/l-2p7(l-P7) 


3. 2p < a, he. p 7 < |, 


(3.13) lim lim P{ 


'•/k {jx,k,n,p - 7 ) 


< x) = ^{x), X £ M. 


fc^cxD n^oo 


7(1-P7) 

\/l-2p7 
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Proof: For the anyone of the following proofs we use Proposition 2 and 
the CLT 

1. By (3.2) for s = 1 and (3.4) we have 

lim lim P( ^ < x) = lim lim P(7x knO < + t) = 

fc->OOn->0O 7 fc^00fl->00 ’ ’ ’ 


lim P(^ Vln Ui “ < x^+ 7 ) = lim Ei=i In 7 ) 

2. By (3.1) for s = 1 and (3.3) we have 

lim lim < ^) = 


< x) = 4>(x), 


k^oon—>-ao 


m 

\/l-2p7(l-P7) 
PI 


+ 


1 


= lim lim PiHx knv < x ^ ^ , 

k^oon^Qo ’ ’ ’ y/k^^/1 — 2p7(l — p'y) 1~P7 


1 ^ P 

lim P(- t/. “ < 

fc^-oo * 


X- 


P7 


1 


VfcVl - 2p7(l -P7) 1-P7 

= hm F[ -< x) = <I>(x), X e 


) = 


/c—)-oo 


P7 

Vl-2p7(l-P7) 


3. Here we consider the function h{y) ~ | ~ with derivative h'{y) = 


py 


apply the delta method and obtain 


lim P{ 

k^oo 


/ 

Vk {^X,k,n,p - 7 ) 

= lim P{ 

k^OQ 




/ 1 

1 1 p^7^ 

V\ 

1 


V ypH 

' 1 \4 (I 2p7)(l-P7)^ 

l-P~f J 


< X 


= hm P 

k—^oo 


Vk I 1- 




■7 


7(1-P7) 

\/l-2p7 


< X I = 4>(x), X G M. 


□ 


X G M. 


Note: Again 1. is just a particular case of 3. for p = 0. 
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4. How to find the most appropriate pi. In this section we discuss 
how to find the most appropriate p for the generalized Hill estimator. 

If we consider the smallest asymptotic variance of the Generalized Hill 
estimator, using Theorem 2., 3) we obtain that if p goes to 0 the variance 
goes to its minima which is 1. More precisely if we consider the function 


f{p) 


7(1 -P7) 
VI - 2p7’ 


for p < ^ ^ then 


fip) 


l^P 

\/(l - ‘^Plf 


The last means that this function has minima for p = 0. Due to the fact 
that this is the degenerate case we can only use p 0 . 

Now we use Berry-Esseen theorem in order to explain how p, together with 
the asymmetry of the distribution, influences the accuracy of the estimators. 
Let us first remind this theorem. 

Berry-Esseen theorem Let Xi,X 2 ,... be i.i.d. r.vs with EX = a, 
Var X = >0 and E\ = r < oo then there exists a positive constant 

C such that 


< I) - 4.(x) 


a 


< C 


n 


for all X G M and n G N. Here $ is the c.d.f. of the standard normal distri¬ 
bution. 


We would like to apply this theorem to the convergence of the sequence 
{\Y!i=iUi “,A; = 1,2,...|, where Ui,U 2 ,...,Ufc are i.i.d. uniformly dis¬ 
tributed r.v’s on (0,1) and p G M, to the standard normal distribution. In 
order to be able to make this we need to impose the restriction the third 
moment of “ to exist, that means that we will consider only p < ^ ^ 

and p V 0. In that case for 7 = I/a 


E 


Di “ - EU^ 


Vara 


1 


3 


2 V 1 - 2p7(l +pj) 
1 — 3p7 


The case of In could be considered again as a particular case of the 

above expression for p = 0 , because it is easy to calculate that 

3 


E 


In 


- E In Di “ 


Var In 


= 2 . 
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Let us now calculate the third moments of the absolute values. 
Case 1. p < 0. Denote by a = EUi “ and by cj^ = Var EU^ “. 

P(C/““ < EU^^) = {EU^^)~p = 


P(C/i “ > “) = 1 - {EU^ “)-f = 1 - 






VarU^ “ 


= £■ 





VarU^ “ 


“ V P(C/i “ > EU^ “)+ 


— E. —R 

EU^ “ - “ 



EU^ “ > C/^ “ > P{EU^ “ > C/^ “) = 




a > [/- 


1 


a 


-1/(P7) 


/o“ ' (« - + /aiM 


A 2 

■(1+2A)(1+A)^ 


)3/2 


2(1 - 2 p 7 )V 2 


PI 


+ l - Q{1 - ^ :=(/)(p, 7 ) 


1 — 3p7 

We used that A := —p 7 > 0,a := 0 < a < 1, and the fact, 

that for a random variable with density / and finite expectation we have 

E{x\a <x<b) = {f;:f{t)dt)-^f;;tf{t)dt. 

Case 2. p E (0, 1 /( 87 )) Using analogous computations we receive the same 
final formula as in the previous case. 

Case 3. p = 0, the Hill estimator. 


In -E In U 


1 


Var In U 


ip 


1 


12 

=-2 = lim (j){p, 7 ) « 2.4146. 

e p^o 


Note that this value does not depend on a. 

The function 4>{p,j) is plotted at range pj € (—8,0.3) at the Figure 9. 
So, we can conclude that if we minimize the variance of the estimators we 
should chose 


(4.1) 


PH = 0 
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and this corresponds to the Hill-estimator. If we would like to chose p in 
such a way in order to have smallest distance between the distributions and 
determine this closeness by Berry-Esseen theorem for fixed k, then we receive 
optimal p by 

(4.2) Popt{a) « -1.221a. 

Also from the Figure 9 we see that function (p{p, 7 ) is convex and that 
we obtain the interval —7.64 < pj < 0 for such p parameters which solve 
< '^( 0 , 7 ), where generalized Hill estimator is better that the Hill in 
the sense of Berry-Esseen. 

[19] works under Mason condition k{n) = o(n). Their estimator jn\k,r) 
coincides with Harmonic mean estimator of [1] . The relationship is I — p = 
/3 = 1 — r. In case of kn = o{n) and F £ 2RV-a,p, p < 0 [19] obtained for 
^n\k,r) an explicit (and rather simple) expressions for the optimal value 
of the parameter p. Their formula is 

(4.3) p- = 

27 

It is easy to check that using p* from formula (4.3) is not giving an 
optimal value of p for examples 1-3 neither in the sense of Berry Esseen 
approximation (see 4.2), nor the optimal variance for Hill-estimator (4.1), 
since it overestimates p. Indeed, we have 
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• for Example 1, for HW{1, —1) we have 

P* = 2 - ^ ~ 0-177 >PH> Popt{l) = -1.221. 

• for Example 1, for HW{2, —1) we have 

5 \/ 17 

= 0.438 > PH > Popt{2) = -2.442. 

• for Example 1, for HW{2, —1) we have 

\/l4 

P* = 2-^^ 0.129 >PH> Poptil) = -1.221. 

• for Example 2, we have p = 0,a = 1 and thus we obtain 

p* = 1-^^ 0.292 >PH> Poptil) = -1.221. 

Actually, it can be easily checked for (4.3) that p* > 0 for all a > 0, p < 0. 

5. Empirical investigation. Let us assume that the observed r.v. X 
has continuous theoretical c.d.f. F which is in the max-domain of attraction 
of some extreme value distribution with parameter 7 E M. From the The¬ 
orem 7 in [20] we see that this d.f. has generalized Pareto upper tail with 
parameter 7 . Due to the laws of the zero and once we can say that if we 
have independent observations of some random variable X and if we have 
enough data then the Pareto tail behavior will always appear in the data. 


In this part we show that the Pareto tail behavior could be observed in 
usual real data set with not too much observations and we do not need 
to check for the second order regular variation condition in order to use 
the asymptotic normality and to obtain confidence intervals of the index of 
regular variation. The data that we use here are taken from recent study 
of snow extremes in Slovakia (see [22]). The observed random variable k 
explains the ratio of the snow load to the characteristic snow load. In the 
spirit of the reproducible research and because of the sample size is only 
n = 41 we present also the data set. 
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The mean excess function of the data is given on Figure 10. It shows that 
appropriate choice of the threshold above which we can consider Pareto 
behavior of the data is u = 1.65. We observe that 18 observations exceed 
this threshold. 


Fig 10. Mean excess plot Fig 11. Generalized Hill and Hill plots 


K> 1.65, p =-0.1 



The Generalized Hill and Hill plots for p = —0.1, together with the Hill 
estimator and its confidence intervals, based on the normal approximation 
(see formula (3.10)), are given on Figure 11. The Hill estimator for k = 17 
delivered estimator 7 = 0.0829 for value of 7 and 0.95 conhdence interval is 
(0.0799,0.0865). Its Generalized Hill estimator for p = —0.1 is 0.0831. Now 
we can use the peaks over threshold technique for the estimation of the high 
quantiles and obtain that for x > 1.65 


P{K > x) 


X X-1/0.0829 18 

Tm) 41 


The last means e.g. that the level 2.5 will be exceeded approximately 2.4 
times in 1000 years (see [ 20 ]). 


6. Discussion and conclusions. In this paper we illustrated a flex¬ 
ible approach for extreme value modelling. In particular, we have proven 
asymptotic normality without 2 nd order regularly varying condition, suit¬ 
able for a small samples or complicated practical examples. We have also 
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illustrated theoretically that 2nd order regularly varying condition is not 
necessary for asymptotic normality. The alternative requirements on design 
for such samples are needed. Namely, we do not expect discovery of Pareto 
tail in arbitrary small amount of data e.g. less than 30 unless these data 
come from exact Pareto distribution. In the last case usually small amount 
of data, more than 30, may be enough because in such case the normal 
approximation works relatively well. We cannot apply generalized Hill es¬ 
timators when we have no RV tails. However, early or soon any subset of 
data that come from independent observations of r.v. with c.d.f. with regu¬ 
larly varying tail will show its Pareto tail behavior if we have enough data. 
From practical point of view, enough data means its mean excess function 
to become increasing from some point further on. In such case in order to 
estimate Pareto tail we take only the biggest observations. 
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